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OBJECTIVE ESTIMATES BASED ON EXPERIMENTAL DATA AND 
INITIAL AND FINAL KNOWLEDGE 
by Burt M. Rosenbaum 
Lewis Research Center 

SUMMARY 

A reformulation of the expression for "entropy" in the presence of experimental 
data enables the determination of "least -biased" probability estimates in the a posteriori 
case for both discrete and continuous sample spaces. The method is an extension of that 
used by Jaynes in 1963 by means of which he estimated least-biased probability values 
for the a priori situation for a discrete sample space where his probability estimates 
objectively took into consideration any prior knowledge that an experimenter might have 
at his disposal. The modification employed herein allows for the adjustment of the in- 
fluence of prior knowledge on the posterior distribution in accordance with the subjective 
degree of belief in the accuracy of the prior knowledge. Also, as part of the modified 
method, any posterior knowledge is treated in the same way as the prior knowledge. 

Thus, the "best" posterior estimates depend on initial or prior information, exper- 
imental data, and final or posterior information. The postulated method agrees with the 
rule of Bayes and affords an insight into the rules of Laplace, namely, the principle of 
insufficient reason and the rule of succession. Several simple examples are treated in 
detail to illustrate the procedure followed. 


INTRODUCTION 

Scientific inference techniques based on Bayesian methods are internally consistent 
(refs. 1 and 2). As soon as a prior distribution has been set up by the statistician, the 
rule of Bayes states how the experimental data modifies the prior distribution to yield a 
unique posterior distribution. 

The difficulty associated with the application of Bayesian concepts to a particular 
problem can be attributed to the arbitrariness of the assumed prior distribution (refs. 3 
and 4). Two statisticians with the same initial knowledge will, in general, not assume 



the same priors and, hence, their posteriors or final distributions will also differ. It 
has been argued that the two posteriors obtained would not be significantly different and, 
in any event, if a prior is smooth, the experimental evidence will eventually overwhelm 
any bias that might exist in the prior; therefore, any prior that is consistent with the 
initial information is satisfactory (ref. 5). However, while this may be the case when 
’’sufficient" data is on hand, the question as to whether a particular prior is or is not 
biased still remained unanswered. 

The establishment of unbiased priors was the problem Jaynes considered (ref. 6). 
Jaynes used the maximum entropy principle employed in information theory (ref. 7) and 
statistical mechanics (refs. 8 and 9) to obtain estimates of prior probabilities. Using 
these estimates, Jaynes calculated expectation values of a loss function associated with 
each allowable decision. The solutions that he obtained showed that the optimum deci- 
sions - those yielding the smallest expected loss values - were the same as one would 
normally choose based on a common sense approach. When the initial knowledge or 
constraints governing a particular decision problem changed so that it was obvious that 
the optimum decision should change, it was found that the maximum entropy concept also 
dictated a like change in the optimum decision. When the problem under consideration 
became so complicated that common sense or intuition could not definitely single jut the 
optimum decision, the mathematical approach based on maximum entropy still was able 
to indicate unambiguously a particular decision as optimum. The major point to be 
noted is that Jaynes had constructed a quantitative method for establishing values of 
prior probabilities that could be said to be objective and "optimally" reasonable. 

Although Jaynes had devised a method for calculating objective optimum values for 
prior probabilities from initial knowledge, these values could not be used to calculate 
unique values of posterior probabilities. In order to use the Bayes rule in going from 
initial to final probabilities, what is needed is the prior probability density function for 
the probabilities themselves, and Jaynes’ method does not generate this function. 

This report extends the method of Jaynes so that probability estimates based on both 
initial information and experimental data may be made. The extension affords a new in- 
terpretation of Laplace's principle of insufficient reason and introduces a weight param- 
eter into the calculation dependent on the statistician's degree of belief in his original 
"guesstimates" . 


SYMBOLS 


a i 


number of times outcome i would be expected to occur based on initial infor- 
mation 


vector { ai ,a 2 ,. . .,a n } 


2 


E [p./D] 
E[X] 

%) 

f(p) 

f(t) 

m 

f(x) 

e k (i) 

<g k > 

H 

h q (i) 

<V 

K 

M 

M 0 

M r 

m i 

in 

n 

n i 

p(d/p) 

p 

p 

Q 

s 

t 


adjustable parameter 

expectation value of p^ based on the posterior probability density function 
expectation of X 

probability density function for p^ 

multivariate probability density function for variables jp^Pg, • • • ,P n j 

probability density function for t 

probability density function for a 

probability density function for x 

function of index i 

expectation of 

modification of entropy function defined in eq. (36) 
function of index i 
expectation of h 

VI 

number of prior constraining equations 
number of actual experimental measurements 
number of initial hypothetical measurements 
number of measurements of runout time 
number of times outcome i occurs 
vector jmp . . -,m n j 
number of possible outcomes 
multiplicity associated with outcome i 

probability of data given that p takes on values |p^,P2> • ■ • ,P n j 
probability 

vector { Pl ,p 2> . . .,p n } 

number of posterior constraining equations 
entropy defined in eq. (4) 

’’entropy” function defined in eq. (35) 
time-to-failure, hr 
times -to -failure, t < t^ 


3 



w 

X 

x 

B(q,r) 

r(q) 

6(x) 

X k 

Mx 

*q 


number of randomly selected white balls 
bounded continuous random variable 
value of X 

Beta function of q and r 

Gamma function of q 

Dirac delta function with argument x 

■Hi 

k Ln Lagrangian multiplier, k / 0 
mean of t 


mean of X 

q** 1 Lagrangian multiplier, q / 0 
alternating fatigue stress 
variance of t 


”X 
Subscripts: 


variance of X 


altered (after incorporation of some of the data) 

final or posterior (after acquisition of data) 

outcome i 

k n prior constraint 

q posterior constraint 
th 

aT outcome of n. possible outcomes 
0 initial or prior (before acquisition of data) 

Superscripts: 

least -biased estimate 

~ objective (not necessarily least-biased) estimate 

average value 


A 

D 

i 

k 

q 

a. 


METHOD OF JAYNES 

We consider an experimental measurement which can take on any one of n 
exclusive distinct results where the possible results are labeled by the numbers 


mutually 


4 


1,2, . . . ,n. Let p., i = 1,2, . . . ,n, denote the probability that the system is in the 
state that yields the i result where it is assumed that the nature of the experiment is 
such that p. is independent of time. We have 

t* i " 1 (1) 

i=l 

Suppose, before any experimental measurements are made, we know or can make a 
guess of the values of the expectations of certain functions of i and that this prior 
knowledge or information may be expressed by K independent relations of the form 

2 P i g k (i) = < g k>’ k = 1 > 2 »- ■ *’ K (2) 

i=l 

where 


K < n - 1 (3) 

According to Jaynes, the least -biased prior probability estimates (pgj , i = 1,2, . . . ,n, 
are those that maximize the entropy 1 


n 

S = - ^ p. In p. (4) 

i=l 

subject to the constraints given by equations (1) and (2). 

Notice that if there is no initial information about the experiment other than the pos- 
sible results, then there are no constraints except that of equation (1), which always 
applies, and the entropy S is greatest when all p i are the same; that is, 

( ; °H (5) 

Hence, in the absence of any information, the least -biased prior probability estimates 
are the same for every possible result. These estimates agree with Laplace's principle 
of insufficient reason, sometimes attributed to Bernoulli, which assumes that each re- 
sult is as likely to be true as any other result unless there exists some reason for as- 
suming otherwise. Jaynes has noted that the maximum entropy criterion leads to prior 
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probability estimates that are as uniform as possible subject to the fact that the con- 
straints must hold. 

In reference 6, Jaynes applies his method to the situation wherein a plant manager 
must decide which single color to paint the day's output of 200 "widgets". This article 
by Jaynes represents a stimulating and plausible argument for the method in general. 


ROWLINSON'S CRITICISM OF JAYNES' METHOD 

Although Jaynes’ arguments appeared convincing to some (e.g. , refs. 10 and 11), 
it was still opposed by the classical statistician for whom the probability of an outcome 
can only be interpreted as the frequency with which it occurs in a given experiment or a 
given idealization of an experiment. From this viewpoint, parameters in a probability 
distribution are constants which cannot be said to possess probability distributions. On 
the other hand, the Bayesian statistician treats any physical parameter about which he 
has less than full knowledge as a random variable where the probability that the param- 
eter under consideration may take on a particular value or lie in a given interval repre- 
sents a person's degree of belief in that happenstance based on what the person knows or 
feels at the time. This latter viewpoint is anathema to the classical statistician. 

Rowlinson (ref. 12) discussed a game of chance in which the score on each turn 
could be any integer from 1 to 6. If it is known that the average score is 4. 5, then, em- 
ploying Jaynes’ analysis, we have 


6 


Z iPi = 4. 5 
i=l 


(6) 


and we can maximize 


S = 


6 

-Zv-pi 


i=l 


(7) 


subject to the constraints given by equations (1) and (6) to yield Jaynes' least -biased es- 
timates ^Pq\ f° r the probability of each integer on any turn. Carrying out this process 
gives 1 



(8) 
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Rowlinson, however, states that there is really no reason for trusting that the prob- 
abilities as generated by the Jaynes' method are actually correct. For instance, he 
says that each turn of the previously mentioned game of chance might consist of noting 
the number W of white balls in a sample of five balls randomly selected (with replace- 
ment) from an urn containing seven white and three black balls, the score i for each 
turn being (W + 1) . For this game, 



7 i_1 0. 3 6_i , 


i = 1,2,. . .,6 


(9) 


The p/s for equations (8) and (9) are as follows: 


Method 

Score, i 


1 

2 

3 

4 

5 

6 


p i 

Equation (8) 

0.055 

0.080 

0.120 

0.165 

0.235 

0.345 

Equation (9) 

.000243 

.02835 

.1323 

.3087 

.36015 

.16807 


It is seen that the two probability distributions are markedly different. On this basis, 
Rowlinson rejects the principle of maximizing the entropy as a "useful way of attacking 
the problem" in the first place. 

The rationale behind Rowlinson' s argument is that taken by the classical statistician. 
Rowlinson throws out the maximum entropy estimates of p i because there are situations 
that exist when these estimates are not correct. In fact, Rowlinson would object to any 
prior estimate of p^ because we can always manufacture games where the prior esti- 
mate would be far from being correct. Indeed, Rowlinson maintains that on the basis of 
the information given, namely, that the score can be any of the integers from 1 to 6 and 
that the average score is 4. 5, there is no means of estimating the p/s. 

The position of Jaynes is that, although the p^'s as given by equation (8) are the 
least biased, any set of p^'s satisfying the constraints might actually be correct. Be- 
cause the average score per turn in the example is 4. 5 and this value is larger than it 


7 



would be if all were equal, one would reasonably expect the p/s associated with the 
larger values of i to be larger than those associated with the smaller i-values. This is 
what the maximum entropy concept objectively accomplishes. Knowing only that the av- 
erage score is 4. 5 does not make the assumption that p g is more than twice as large as 
Pg (as is given by eq. (9)) reasonable. The maximum entropy criterion can be said to 
give equal p/s unless there is a good reason for not doing so and, in the event that 
there is such a reason, the criterion still endeavors to keep the p^s as nearly equal as 
possible while satisfying the constraints imposed by prior knowledge. The more prior 
information that is available, the more one would expect the least -biased probability es- 
timates to be closer to reality. However, in any but the complete information case, the 
chance that these probability estimates are exactly correct is for all practical purposes 
zero because the number of possible distributions is infinite. 

To illustrate that least -biased probability estimates become better approximations 
to correct values as prior knowledge increases, let it be suspected that p^ is twice Pg 
so that the additional constraint that 

P 5 = 2pg (10) 

is imposed. The least -biased probability estimates become 




P0. 


(1.653) 1 


1. 5(2) 1 / 3 (1. 653) 16 / 3 + ^ (1.653)^ 

3=1 


i= M, 3, 4 


H = 


2 1 ^ 3 (1.653) 16 ^ 3 


1. 5(2) 1//3 (1 .653) 16 / 3 + ^(1.653)^ 

3=1 


(ID 


P °'6 2 \ / 5 
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The probability distributions given by equations (8), (9), and (11) are plotted in the fol- 
lowing sketch: 



And we see that, for every value of i except for i = 3, the p^ given by equation (9) is 
closer to the p i of equation (11) than to that of equation (8). 


MODIFICATION OF THE FORM OF THE ENTROPY FUNCTION TO 
INCORPORATE MULTIPLICITY CONSIDERATIONS 


We consider a slightly altered model from the one heretofore considered. Suppose 
each possible experimental measurement labeled i, where i = l,2,. . .,n, arises from 
any of a group of u mutually exclusive distinct outcomes. Let p iQ? represent the 

tVi i 

probability that an experimental result is the a- outcome in the set of n. outcomes 

j_ l 1 

associated with the i possible experimental measurement. We have 



( 12 ) 


where equation (1) still applies so that 



i=l o^=l 


(13) 
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Again, suppose that the prior knowledge is given by equations (2). Then, we can proceed 
with Jaynes' method where the entropy now is given by 


S = 



(14) 


Maximizing S subject to the constraints given by equations (13) and (2) yields the least- 
biased probability estimates 



exp 


K 


X 0 " X/ X k g k^ i) 

k=l 


(15) 


where Xg-ljXjjXg,. • • ,X^ are Lagrangian multipliers whose values are determined by 

substituting equation (15) into the constraining equations (13) and (2). 

From equation (12) and the fact that ( p Q J is independent of a. 

\ u /ia^ 



(16) 


Hence, the maximum value of S subject to the constraints occurs when each of the out- 
comes corresponding to a given value of i has the same probability of happening. Note 
that the value of (pq| as given in equation (16) is modified by the factor n^ where m 

is the multiplicity associated with the i experimental measurement. 

When each of the m outcomes, on = 1, 2, . . . ,m, has the same probability, we 

have 
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and equation (14) takes on the form 


S = 




( 17 ) 


Thus, the least-biased estimates as given by equation (16) result from maximizing 
equation (17) for S subject to the appropriate constraints. Because the multiplicity 
associated with a given experimental measurement can never be neglected, the expres- 
sion for S that applies to every problem should always be that given in equation (17). 
However, in the absence of any information about the multiplicity of the various exper- 
imental results, n^ is taken as unity for each i and the expression for S reverts to 
that given in equation (4) . 

As an example of the application of equation (16) , let us turn once again to the game 
of chance suggested by Rowlinson. For this game, m is the number of ordered ways 
of selecting (i - 1) white balls and (6 - i) black balls so that 


n i = 



1 , 2 ,. 


(18) 


The equation of constraint is equation (6) where g(i) = i. By equation (16) 



"W 


Employing equation (1) to eliminate 



gives 



(19) 


( 20 ) 
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Substituting equation (20) into constraining equation (6) results in 



or, using the binomial theorem, 



4(1 + 



= 4.5?(1 + 4) 5 


( 21 ) 


where 


4 s e 


-A, 


( 22 ) 


We get from equation (21) 


e 


A 


7 

3 


and equation (20) becomes 




(0. 7) i-1 (0. 3) 6_i , i = 1,2,. 


.,6 


(23) 


Hence, the least-biased estimate |p for is identical to the correct values as 
given in equation (9) . 1 

The fact that the least -biased probability estimates are exactly those given by a 
binomial distribution (when the binomial multiplicity or degeneracy of an experimental 
measurement is included in the entropy formulation and the constraining condition fixes 
the true average) was pointed out by Jaynes (ref. 13). Indeed, Jaynes went so far as to 
state "if the experiment fails to confirm the maximum -entropy prediction, and this dis- 
agreement persists on indefinite repetition of the experiment, then we will conclude that 
the physical mechanism of the experiment must contain additional constraints which were 
not taken into account in the maximum -entropy calculation. The observed deviations 
then provide a clue as to the nature of the new constraints". 
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CONSEQUENCE OF THE RULE OF BAYES WHEN NO PRIOR 
INFORMATION IS AVAILABLE 

We now turn our attention to a situation that has been well documented in the litera- 
ture. The problem that will be reviewed in this section will serve as a limiting case for 
the generalized method to be proposed later in this report and, as such, is important in 
providing necessary insight. 

■FVi 

Again the index i denotes the i tn possible result of n mutually exclusive pos- 
sible results and p^ denotes the probability of getting the i n result. We suppose that 
no prior information is available and that, in a total of M repetitive measurements, the 
first result has turned up m^ times, the second result mg times, and so forth. We 
have 


n 

y>. = M (24) 

i=l 

What we wish to calculate on the basis of the data ^mping, . . . > m n j is a set of objec- 
tive (not necessarily least -biased) posterior estimates ^p^ of the probabilities p^. 

The procedure we shall follow is to first set up the prior distribution functior for the 
probabilities, then to employ the rule of Bayes to obtain the posterior distribution func- 
tion for the probabilities, and finally to calculate the desired objective probability esti- 
mates by finding the expectation values of the probabilities based on the posterior distri- 
bution function. 

Because at the outset nothing is known about the occurrence of any of the outcomes 
except for the conditions that p^ > 0, i = 1,2, . . . ,n, and equation (1) holds, all possible 
vectors p = jp^Pg, • • • >P n j are equally likely. In other words, every point in the 

n-dimensional hypercube 


0 < pj ^ 1 

0 ~ P 2 ~ i 


0 


1 
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is as likely to be true as any other point provided both points satisfy equation (1). The 
prior probability density function consistent with this viewpoint is 

f 0 (?) = V p l’ p 2’ * * • > p n ^ = c (25) 

for all points on the surface S where S is given by the equation 

n 

i=l 

The value of the normalization constant c must be such that integration of the proba- 
bility density function over the (n-1) -dimensional surface S gives unity; that is, 



(26) 

The multiple integral in equation (26) may be evaluated by using the Beta function 
identity (ref. 14): 


B(q, r) 



ft(l - t) r dt = r & t - 1)r < r + *> ; 

T(q + r + 2) 


q,r > -i 


(27) 


Here T(q + 1) denotes the Gamma function with agrument (q + 1). Changing the variable 
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of integration in equation (27) from t to x = at, we get 



x q (a - x) r dx = 


r(q + l)r(r + 1) 
r(q + r + 2) 


a q+r+l 


(28) 


We now can easily show that the integration of the multiple integral in equation (26) 
can be carried out as a succession of integrals of the type appearing in equation (28) . 
Performing the integration over P n _i (corresponds to q = 0, r = 0, a = 

1 - Pi - $2 ~ ' ' - " p n-2^ leads to the following integral taken over P n _ 2 : 



l-P r P 2 -...-P n _ 3 

(1 - Pl - P 2 - • • • - p n -2 )dp n-2 


This integral is of the form given in equation (28) with q = 0, r = 1, and 


a = 1 - Pj - p 2 - . . . - P n _g 

Hence, application of equation (28) leads to the following integral now taken over p„ 

n-o 

■i- Pl -p 2 -...- Pn _ 4 

0 


r(i)r( 2 ) 
r(3) 


/’ 


(i - Pj - p 2 - 


p n-3 } dp n-3 


which again can be evaluated by applying equation (28). Carrying out the (n - 1) inte- 
grations in succession yields the value (n - 1)'. / fr for the normalization constant c 
so that 


r 


fn(P) = < 


‘ for all points on S 


elsewhere 


is the prior probability density function for the probabilities p^ 


(29) 
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This distribution f Q (p) can be modified by using the rule of Bayes which takes into 
account the experimental data jm l5 m 2 , . . . ,m n J. The rule of Bayes is given by the 
expression 


f D (P) - 


f 0 (p)P(D/p) 


f 0 (p)P(D/p) 


dS 


(30) 


where 

fj)(p) posterior probability density function for the probabilities 
fg(p) prior probability density function for the probabilities 

P(D/p) probability of the data |m^ , m 2 , . . m n | given that p takes on the set of 

values [p 1? p 2 , • • • ,P n } 

The expression for the P(D/p) is merely the multinomial probability distribution: 


P(D/p) = 





(31) 


while that for fg(p) is given'by equation (29). The denominator of the right side of 
equation (30) can be evaluated in the same way as was done for the multiple integral of 
equation (26) . We obtain 


f D (p) 



(M + n - 1)1 

m,! m 0 '. . . . m 
l l n 


m 1 m 9 

p 2 lp 2 



(32) 


on surface S. 

The posterior probability density for the probability p i alone may be obtained by 
integrating over the other probabilities: 


f D (Pi) 


if ' l f D (?)V ^ dP l dp 2 • • * dp i-l dp i + l ’ ’ • dp n-l 


= (M + n - 1)1 p 
m.'. (M - im + n - 2)'. * 


M-m.+n-2 

P t ) , 0 < Pi < 1 


(33) 
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The expectation value of p^ based on the posterior probability density function as given 
by equation (33) is our desired objective estimate ^p^ of p^. The estimate is then 

~ f 1 m. + 1 

(PD).= E (Pi/ D )= / Pi f D^ d Pi = (34) 

1 Jo 

This relation, which holds when the prior probabilities are uniformly distributed, is 
known as Laplace’s rule of succession (refs. 15 and 16). Thus we can say that according 
to the Bayes-rule estimate the probability that the next measurement will be the i n re- 
sult is (m^ + 1)/(M + n). 

This answer differs from that given by the maximum -likelihood estimate of p^ 
which is im/M. We note that the maximum -likelihood estimate of p^ would be iden- 
tical to the Bayes-rule estimate if, in the maximum -likelihood case, we supposed that, 
before any actual measurements were made, there were n hypothetical measurements 
wherein each of the possible outcomes turned up exactly once. 


MODIFICATION OF THE EXPRESSION FOR ENTROPY WITH THE ACQUISTION 
OF DATA IN THE ABSENCE OF PRIOR INFORMATION 


Equation (34) has shown that when we start with a uniform distribution for the prob- 
abilities p the use of Bayes' rule to incorporate the data jm^nig, . . . , m n j leads to 

the objective estimate (rm + 1)/(M + n) for p^. Now we remark that application of equa- 
tions (16) and (17) with n^ replaced by (m^ + 1) demonstrates that the same result could 
be obtained by finding those p/s that maximize the expression. 



(35) 


subject, of course, only to the constraint given by equation (1). Hence, this 
modification of the entropy expression should be examined in order to see whether it can 
be interpreted in any sensible way. 

First, equation (35) says that the entropy formulation changes with the accumu- 
lation of experimental data. If we consider the term (m. + 1) as the number of measure- 

i.U 1 

ments resulting in the i Ln outcome, then the initial state of no data can be interpreted 
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as starting out with a total of n fictitious measurements, one each for every possible 
outcome i. But this is the same correspondence that we noted in the last section in 
order to obtain agreement between the maximum-likelihood estimate and the Bayes-rule 
estimate. 

Second, we see that as the number M of measurements increases, the least -biased 
estimate of p. based on equation (35) more and more approximates the frequency with 

j.u 1 / +* \ 

which the i tn result occurs; that is, IpjJ — nr/M as both m^ and M become large 

compared to n. Obviously this should be the case. 

Thus the reformulation Sp of the expression for entropy in the presence of data 
appears to yield sensible answers for the limiting cases of M = 0 and M— *>. Also, 
the theory as based on equation (35) illustrates that Laplace's principle of insufficient 
reason and Laplace's rule of succession now rest on a common footing; namely, the 
initial state corresponding to the situation where no prior knowledge exists consists of 
the assumption that n measurements have been made, each possible outcome of the 
n possible outcomes turning up exactly once. Hence, this is equivalent to starting out 
with the assumption that all outcomes are equally likely. 


CONSIDERATION OF THE PRIOR DISTRIBUTION WHEN PRIOR 
INFORMATION IS AVAILABLE 

To repeat, we were able to interpret the term (m. + 1) appearing in equation (35), 

7 1 j-i- 

the 1 being the number of hypothetical measurements yielding the i result for the 
case where no prior knowledge is on hand and the m. being the actual number of mea- 
surements yielding the l result. By contrast, when we have some initial information, 

then, by applying Jaynes' method, we can obtain least -biased prior estimates of P 

and no longer would it be "reasonable” to assume a priori the same number of hypo- 
thetical measurements for each of the outcomes. Instead, we can generalize the expres- 
sion given in equation (35) by writing 


H = 




m. + a. 

l i 


i=l 


(36) 


where a^ is the number of times that outcome i would be expected to occur based on 
our initial information and Jaynes' method, where equation (17) is employed as the ex- 
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pression for entropy. Thus, we let a^ be given as 

a i = ^ n (p 0 ). (37) 

where is an adjustable parameter that plays the role of the flattening constant 
(ref. 16). We note that equation (36) reverts to equation (35) when no prior information 
is available and - 1 . 

By equation (37), the number of hypothetical prior measurements is 

M q = <<fn (38) 

so that equation (37) takes the form 

a i = m o(po\ < 39 > 

It may be noted that the larger the value of ^ or Mg the more data is needed to sig- 
nificantly alter the original hypothetical distribution a = ja-pag, . . . > a n j- The greater 

the degree of credibility in the initial information, the larger the value should 
take on. Caution must be exercised in this regard because the vector a corresponds to 
a complete specification of the distribution and this distribution, even though it repre- 
sents the least -biased distribution based on initially known true values of averages, 
could be very far from the appropriate distribution. There might be some cases when it 
is justifiable to take <4 as large as 3 or 4. Even zero might be chosen for <<f if 
it is desired to obtain the maximum likelihood estimate of p. 

At this point, the concepts associated with the generalized method by which we can 
proceed have been completely established. If there is some initial knowledge as given by 

the constraints, then, by using Jaynes’ method, the least -biased prior estimate (^oj> as 

given by equation (16), is set up. The hypothetical distribution a is found and inserted 
into the expression for H given in equation (36) by using equation (37) (or by eq. (39)) 
and our degree of credibility as characterized by <€ (or M Q ) . If the experimental data 
jmpmg,. . . ,m n j are now incorporated into the problem, then H may be maximized 

with respect to p subject to any constraints that may apply a posteriori. If these 
posterior constraints are written as 

n 

£p.h q (i) = <h q >, q = 1,2,. . .,Q (40) 

i=l 
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then the expression for 



may be given by 



-v 


0 


= (mu + a^e 


Q 

- £ v h (i) 
Lj q q 
q=l 


( 41 ) 


where Vq-1,v^, . . . , Vq are Lagrangian multipliers. Thus, this procedure yields 

least -biased estimates ^Pgj , i = 1,2, . . . ,n, which take into account both prior and 

posterior information as well as experimental data. 

Note that maximization of H with respect to p when all nn = 0 yields the least- 
biased estimates 


4^j 

] 

so that, in the limit where all m i = 0, the generalized method still gives the same values 
for the least -biased estimates as those given by the Jaynes' method. 

It may be observed at this point that the method as stated may be employed for the 
case of continuous random variables with very little change in viewpoint. This fact is 
shown in the subsequent examples illustrating the generalized method. 



EXAMPLE INVOLVING DISCRETE VARIABLE 

We consider a simple example to illustrate the method. Let the number n of pos- 
sible outcomes be 3 with i = 1, 2, or 3 designating the three possible outcomes. Let 
the prior information be given by 

3 

£iPi = !.5 (42) 

i=l 

Also, the multiplicity of outcome 3 is believed to be twice that of either outcome 1 or 2. 
Hence, n. = rig = 1 and n^ = 2. 
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Applying Jaynes' method, we find that the least-biased prior probability estimates 

are 




= < 


0.643, 

0.214, 

0.143, 


i = 1 
i = 2 
i = 3 


Taking = 1 , we write H as 

3 



i=l 


Let us assume that six experimental measurements have been made and the data is 
in = ^3, 1, 2 J; that is, outcome i = 1 has occurred three times, outcome i = 2 once, 
and outcome i = 3 twice. Then 


m. + a. 
1 x 



i = 1 
i = 2 
i = 3 


We shall obtain objective posterior probability estimates for two cases: 

(1) Where the constraint as given by equation (42) no longer applies (This is the 
situation that occurs when the constraint represented an initial guess and we want to re- 
lax this constraint a posteriori because the guess might well prove to be wrong. ) 

(2) Where the constraint as given by equation (42) still applies (In this case, we 
know definitely that this relation holds and all probability estimates, whether prior or 
posterior, must conform to this relation.) 

We obtain the following posterior probability estimates: 
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Case 

Outcome, 

i 

Constraints 

1 

2 

3 

1 

0.548 

0.182 

0.270 

3 

E*- 1 

i=l 

2 

.670 

.160 

.170 

3 3 

2>i = l ’ Z ip i = 1 - 5 

i=l i=l 

[ 


We see that for the situation where the constraint that the expectation of i remains 
at 1. 5 still holds (i.e. , case 2), the constraint acts to increase the posterior probability 
values for small values of i and lower those for larger values of i. In case 1, the in- 
clusion of the experimental measurements coupled with a relaxation of the constraint of 
equation (42) served to increase the estimate of the expectation of i from 1. 5 at the 
outset to the value of 1. 722. 

APPLICATION TO CONTINUOUS RANDOM VARIABLE 

We wish to modify the generalized method so that it applies to a continuous random 
variable. Let the random variable we are considering be denoted by X and let f(x) 
denote the probability density function of X so that f(x)dx designates the probability 
that X lies in dx at x, that is, the probability that x < X < x + dx. 

To establish the least -biased prior probability density function fg(x) for X, 

Jaynes' method is used as previously described but, inasmuch as the random variable 
is now continuous where before it was discrete, the sums become integrals. Hence, 
f q(x) is found by the calculus of variations technique as that function which maximizes 

S = - f(x) In f(x)dx (43) 


subject to the constraint 


j~oo 

L o f < x)dx = 1 (44) 


that always holds and any other constraints written as 
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X gfcto*W*E = <gfc> , k = 1, 2, . . . , K 


(45) 


that would express the extent of the experimenter's prior knowledge. The result of the 
maximization is given by 


f q(x) = ex P 


K 


" X 0 "Z X k% (x ) 

k=l 


(46) 


where Xq-1,Xj,X 2,. • • are Lagrangian multipliers. 

Obviously, once the prior distribution fg(x) has been established, the initial num- 
ber Mq of initial hypothetical measurements can be chosen in accordance with our de- 
gree of credibility. Thus, the prior distribution fp(x) can be said to be based on Mq 
measurements. 

Now, we arrive at the problem of incorporating in our distribution any measure- 
ments of X that have been made as well as any posterior constraints that apply. This 
problem can easily be resolved if we look at equation (41). There we see that, in the 
absence of any constraints except that of equation (1) , ^p^J is proportional to the sum 

of the number of initial hypothetical measurements that have yielded outcome i and 
number of actual measurements that have yielded outcome i. The posterior constraints 
merely serve to modify this sum. Therefore, the corresponding expression for the 
least -biased posterior probability density function fp(x) of the continuous random var- 
iable X must be 



M 


Q 

f D (x) = 

M Q f 0 (x) +Z 6 (x " x i> 

exp 

•"o - L W*' 


i=l 


q=l 


(47) 


where the data |xj ,Xg, . . .x m j are the M values that have been observed in M mea- 
surements of X and 6(x - x^) is the Dirac delta function with argument (x - x.). In 
equation (47), i>q-1 > v \> v 2> ' ‘ ' ,V Q are Lagrangian multipliers and the posterior con- 
strains that apply are 


/ h q (x)f D (x)dx = (h q ) , q = 1, 2, . . . , Q 


(48) 
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It is to be noted that, although fj-j(x) as written appears to describe a variable of the 
so-called mixed type, partly discrete and partly continuous, it is not claimed that X is 
such a variable. Rather it is claimed that f j-j(x) is an unbiased probability density 
function for X based on incomplete knowledge about the distribution of the variable X 

a 

and that from f ^(x) one can derive unbiased estimates for the expectations of statistics 
involving X. It might be possible to devise an objective method for incorporating the 
measured values of X in the probability density function f ^(x) in such a way that, say, 
f q(x) and its derivative remain continuous throughout the domain of x but such a method 
will not be attempted herein. 


EXAMPLES INVOLVING CONTINUOUS VARIABLE 
Bounded Continuous Variable 


In this first example, we consider a simple case wherein the sample space for the 
continuous variable X is the open interval (0,1). The prior information is given as 



(49) 


Then, by equation (46), the least-biased prior probability density function is of the form 


f Q (x) = e 


"V X l x 


where Aq and A^ are constants whose values are determined by equations (44) and (49). 
We obtain 


f Q (x) = 1 . 74 e -1 * 23x 0 < x < 1 (50) 

We take Mq to be 4; that is, we assume the prior distribution is worth 4 measurements. 

Now, suppose two independent measurements of X were made and the values ob- 
served were x^ = 0. 817 and Xg = 0. 574. Let us consider two cases: 

(1) There are no posterior constraints. We wish to obtain least -biased estimates of 
E[X], E[X 2 ], and the probability that X < 0. 5. 

(2) The constraint given by equation (49) still applies a posteriori. This means that 
E[X], the expectation value of X, remains fixed. We wish to obtain least -biased esti- 
mates of P(X > 0. 5) and E[X 2 ]. 
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Based on the prior distribution fpCx) given in equation (50), we get 


P 0 (X > 0 


•1 ~ 


.5) = /* f n (x)dx = 0.351 

* 4/2 ° 


! o P 2 ]"/ 1 

y 0 


E Q [X 2 ] = / x 2 f Q (x)dx = 0. 237 


The results for the two cases are as follows: 


Case (1): 


f D (x) = I [4(1. 74)e -1, 23x + 6(x - 0. 817) + 6(x - 0. 574)] 

P n (X > 0. 5) = f 1 f n (x)dx = 0. 567 
u /o . 


(50a) 


(50b) 


0 < x < 1 (51) 


(51a) 


Ep[X] = f x fp(x)dx = 0. 499 

Jo 

E D [X 2 ] = f x 2 fj-j(x)dx = 0. 324 

J n 


(51b) 


(51c) 


Case (2): 


f D (x) = 2.06 e -2 ' 51x + 0.296 e -1 ‘ 28x [6(x - 0.817) + 6(x - 0.574)] 0 <x < 1 (52) 


- /" 1 - 

P D (X > 0. 5) = / f D (x)dx = 0.413 

J 1/2 

E d [X 2 ] =/ 1 x 2 f D (x)dx = 0. 236 


(52a) 


(52b) 
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Now, suppose that one were interested in establishing a least -biased estimate for 
2 

the variance cr x of X for the prior and posterior situations for each of the previous 
two cases. In order to arrive at these estimates, we need to find out just how our prior 
and posterior knowledge arises. The reason that such knowledge is required is shown 
by the following. We have 


/s 

at ! = E 


(X - p x ) 2 l = E j [(X - E[X]) + (E[X] - p x ) 


= E [X 2 ] - (E[X]) 2 + E 


(i[x] - 


(53) 


where the variance of X is denoted by ct x and the mean of X by p x . Hence, in 

order to determine ct x , we not only need to know E[X] and E[X ] but also an estimate 
for the last term on the right side of equation (53) which depends on how close E[X] is to 
the true mean p x of X. Because the constraint (eq. (49)) sets the value for E[X], we 
have to know the reasons underlying equation (49) in order to estimate ct x . 

Let us suppose that, in case (1) at the outset, the prior information consists of 
knowing only the average value 0. 4 of 4 independent measurements of X but nothing 
else about these measurements. Then 


E Q [X] = 0. 4 

represents an estimate based on Mq = 4 measurements and we can write 


E 


(e[xj - p x y 




Substituting this relation into equation (53), we get for case (1) where we have employed 
the values in equations (49) and (50b) 



E 0 [X 2 ] - (E 0 [X]) 2 0.237 .(0.4)2 



0. 103 


In addition, the estimate E D [X] as given by equation (51b) is based on M + M Q = 6 
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measurements. Therefore, by employing the values in equations (51b) and (51c), we find 


(4) d .!s g !i i l!g ^ .o. 09 o 

6 

On the other hand, suppose, in case (2), the prior information as given by equa- 
tion (49) represents the average of a very large number of measurements so that we can 
consider 0. 4 as being very close to the true mean ja^. Then the last term in equa- 
tion (53) is essentially zero so that 

0 =E 0 [X 2 ]-(e 0 [X]) 2 = 0.077 



and 



= E d [X 2 ] - (e d [X ]) 2 = 0. 236 - (0. 4) 2 


0.076 


It should be fairly obvious how to treat an intermediate situation where the value of 
the average of X as given in equation (49) is based on, say, 20 measurements. In this 
instance, the posterior constraint would no longer be equation (49) because, after mea- 
surements of 0. 817 and 0. 574 for X, the value of the average would change from 0. 4 to 

20x0. 4 + 0. 817 + 0. 574 _ Q ^7 
22 


Hence, in this instance, the posterior constraint would become 



x f D (x)dx = 0. 427 


which change would in turn modify fjj(x). Also, the value of 
sidered as based on 22 measurements of X when calculating 


Ep[X] would now be con- 
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Time-to-Failure 


Here we are interested in the time-to-failure of a given part subjected to a pre- 
scribed loading when the material is manufactured according to certain specifications. 
Suppose we have reason to guess initially that the average time-to-failure is about 
10 hours and suppose that five measurements were made, in three of which the failure 
times were 7, 8, and 12 hours and in the remaining two, for one reason or another, the 
experiment was stopped before failure at the runout times of 5 and 10 hours. What we 
wish to find, on the basis of what has been given, is an objective estimate of the expec- 
tations of the first and second moments of the time-to-failure plus, say, an estimate of 
the probability that the survival time is larger than 10 hours. We shall arrive at esti- 
mates for three cases, namely, when the initial guess is equivalent to M Q = 1, 2, or 
5 measurements. 

We first have to find the least -biased prior distribution for the time-to-failure. 
Letting f(t)dt be the probability that failure occurs at time t in dt (t in units of hr) we 
can maximize 


S = - /*°°f(t) In f (t) dt 

J 0 

with respect to the probability density function f(t) subject to the constraints 

/•OO fOO 

/ f(t)dt = 1, / t f(t)dt = 10 
J 0 ■'0 

to find the least -biased estimate for the prior probability density function 

f 0 (t) = 0.1e' 0,lt (54) 

According to the least -biased prior probability distribution, the probability that the 
time-to-failure t lies between times t & and t^ is equal to 

-0. It -0. It 

P 0 (t a <t<t b ) = e “ e (55) 

The average time-to-failure for failures occurring between times t and t^ is for the 
prior distribution 
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(56) 


E oM 


(a, b) 


= 10 


-0. It -0. It, 

(1 + 0. lt a )e a - (1 + 0. lye b 

-0. It -0. It, ~ 

e a -e ^ 


The average square of the time-to-failure for failures occurring between times t and 
*b ls 


S ot‘ 2 l 


10 " 


(0.1t a )^ + 2(0.1t a ) + 2 


n -0. It 


- (0.1t b ) 2 + 2(0.1t b ) + 2je °' ltb | 


(a,b) 


-0. It -0. It, 
e a - e ” 


(57) 


Also the expectation of the square of the time-to-failure for the least-biased prior dis- 
tribution is 


E Q [t 2 ] = 200 hr 2 

Let us follow the method in detail for the case Mq = 5. It is convenient to consider 
the two runout measurement times of 5 and 10 hours as dividing the time-to-failure axis 
into the three intervals: 

I: 0 < t < 5 hr 
II: 5 hr < t ^ 10 hr 
IH: t > 10 hr 

Then, by equation (55), we can find the prior probability estimate of failure occurring in 
each of these intervals. Also, we can make use of equations (56) and (57) and compile a 
table based on the prior probability estimates: 



Interval 

I 

II 

III 

m o(po\ 

1. 97 

1. 19 

00 
*— i 




2.29 

7. 29 

20 


7.31 

55.2 

499 


29 




Thus, before measurements begin, on the basis of our initial knowledge and the as- 
sumption that M n = 5, there are 1.97 measurements in interval I with an average 

2 2 

t-measurement of 2.29 hours and an average t -measurement of 7.31 hour , 1.19 mea- 
surements in interval II with an average t-measurement of 7. 29 hours and an average 
2 2 

t -measurement of 55.2 hour , and 1. 84 measurements in interval III with an average 

2 2 
t-measurement of 20 hours and an average t -measurement of 499 hour . 

These figures can first be altered so as to incorporate the three time -to -failure 

measurements of 7, 8, and 12 hours. The altered probability density function is 


f A (t) = -[s f 0 (t) + 6(t - 7) + 5(t - 8) + 6(t - 12)] 
8 


(58) 


and the table based on this altered probability distribution is 



Interval 


I 

II 

III 

Number of 
measurements 

1.97 

3. 19 

2. 84 


2.29 

7. 43 

17. 18 


7.31 

56.05 

375.0 


These tabulated values may be calculated in an obvious way using the values from the 
table based on fg(t) . For example, the 7-hour and 8-hour failure times fall in inter- 
val II so that 


g rj. j\ _ 1. 19x7. 29 + 7+8 _ 
' A OI 3.19 



1.19x55.2 + (7) 2 + (8) 2 
3.19 


56.05 


The number of measurements in each interval now have to be changed in accordance 
with the two runout time measurements of 5 and 10 hours. The 10-hour measurement 
increases by one the number of measurements in interval III to 3. 84 whereas the 5-hour 
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measurement is to be distributed over intervals II and m. If we use the numbers in in- 
tervals II and III to represent a relative probability of a measurement arriving there, 
then the fraction of the runout time 5-hour measurement ascribed to interval II is 
(3. 19)/(3. 19 + 3. 84) = 0. 454 while the fraction ascribed to interval III is 
(3. 84)/(3. 10 + 3. 84) = 0. 546. Hence, our table now becomes 



Interval 


1 

n 

III 

Number of 
measurements 

1.97 

3.644 

4.386 

64 

.197 

. 3644 

.4386 

(SdN). 

2. 29 

7.43 

17. 18 

(v t2 i). 

7. 31 

56.05 

375.0 


where 



(number of measurements in interval i) 
M q + M 


so that, for Mq = 5, 



(number of measurements in interval i) 
10 


The same kind of reasoning also gives the least -biased estimate of the posterior prob- 
ability density function as 




L°- 


5 f Q (t), 0 < t < 5 
114 [5 f Q (t) + 6(t - 
155 [5 f Q (t) + 6(t - 


7) + 6(t - 8)] 
12)], t > 10 


5 < t < 10 


(59) 
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Thus, for Mq = 5, we have 


S dM -Jjfoj), 

i 


- (0. 197)(2. 29) + (0. 3644)(7. 43) + (0. 4386)(17. 18) = 10. 70 hr 


and 


E D f t2 ] (E D [ t2 l\ = 186. 5 hr 2 

i 

Also, an objective estimate of the probability that a randomly selected part will last at 
least 10 hours is given by 



0.44 


These values enable us to obtain a posteriori an estimate for the variance of the 
time-to-failure t. Again, we resort to equation (53) where X— t and see that we have 

7* \2l 

I E[t] - fj.^ ) under the conditions that E[t] 


to arrive at an estimate of the value of E 


was determined by a total of M + Mq = 8 measurements of the time-to-failure t and 
M r = 2 measurements of runout times. Obviously, a measurement of runout time would 
not be expected to be as effective as a measurement of the time-to-failure t in fixing 
the value of E[t] close to (if 

As shown in any textbook treating stratified sampling, the variance of t can be 
written 




(60) 


where 



probability that t lies in interval i 

variance of t if t is constrained to lie in interval i 

mean of t if t is constrained to lie in interval i 
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It can be demonstrated that the variance of an average value t of t based on (M + M n ) 
measurements of t and M r measurements of runout times is given by the formula 





M + M r 


M + M Q + M r 


(61) 


provided the class boundaries of the intervals are the runout time measurements. Equa- 
tion (61) can be written 


2 a t 
cr_ = — 


t M + M Q (M + M Q )(M + M q 


— 7 Y Pi T(Pt) " Ptf (62) 

M + M ft + MJ 1 L\ t 'i 71 


Now 


/\ 

s E 


(E[‘) - Cl) 2 ] 


(63) 


so that applying equation (53) we obtain the desired relation 




E[t 2 ] - (E[t]f - 


M„ 


(M + M q )(M + M Q + M r ) Z, 


[(=i4 - =t*]] s 


M + M 


0 


(64) 


Using the values obtained for the Mq = 5 case, we get 

(a?) = — — /l86. 5 - (10. 70) 2 - [o. 197(2. 29 - 10. 70) 2 + 0. 3644(7. 43 - 10. 10)‘ 

\ Vd , 1 8(10) L 


1 - - 
8 


+ 0.4386(17.18 - 10. 70)' 


= 81.3 hr" 
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A tabulation of least -biased estimates of the various quantities is given in the fol- 
lowing: 


M 0 

P 

E[t] 

i[t 2 ] 

E[t] 

E[t 2 ] 

a t 

I 

II 

III 

I 

II 

III 

I 

II 

III 

(Prior) 

0.3935 

0.2386 

0.3676 

2. 

29 

7.29 

20 

7. 

31 

55.2 

499 

10 

200 

— 

5 

. 1968 

. 3644 

.4386 



7.43 

17. 18 



56.05 

375 

10. 70 

186. 5 

81.3 

2 

. 1124 

.4217 

.4659 



7. 465 

15.38 



56.2 

295 

10. 57 

162 

61.2 

1 

.0656 

. 4541 

.4803 



7. 475 

14.15 

1 


56.25 

240 

10.35 

141 

43.5 


Fatigue Stress Distribution Based on Runout Data Alone 

In this example, we will consider the same problem treated by Shah (ref. 17). He 

was interested in determining an objective estimate of the probability that a certain 

7 

manufactured part will last longer than 10 cycles under an alternating fatigue stress 

O 

load of 140. 1x10 newtons per square meter (20 325 psi). He had the results of four 

fatigue tests run at different values of the alternating stress o . In all four of these 

7 

tests, the part did not fail in 10 cycles even though the stress loading in each test was 

3 

much larger than 140.1x10 newtons per square meter. The values of the alternating 

stresses for the four tests were 266. 5x10^, 296.2x10^, 377.4X10^, and 247. 9x10 ^ newtons 

per square meter (38 650, 42 960, 54 730, and 35 960 psi). In his paper, Shah used the 

value of 284.6x10 newtons per square meter (41 270 psi) as his prior estimate for the 

7 

average stress at which failure at 10 cycles occurs. 

Let f(a)da be interpreted as the probability that the alternating stress a in da 

7 

will cause fatigue failure in a randomly selected manufactured part after exactly 10 cy- 

7 

cles. Hence the probability that a part will last longer than 10 cycles under a stress 
load of o' is given by the integral 


/■ 00 

/ f(a)da 

•S rc T 


which is also the probability that a stress larger than o' will be needed to cause 

7 

breakage at exactly 10 cycles. 

We handle this problem in the same way as we did the previous problem. The 
a -axis is divided into six intervals where the runout stress load measurements and the 
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stress we are interested in serve as the boundaries. Thus, the six intervals are the 
following: 

I: 0 ^ cr < 140. 1X10 3 N/m 2 

n: 140.1X10 3 N/m 2 < a < 247.9X10 3 N/m 2 

in: 247.9X10 3 N/m 2 < cr < 266. 5xl0 3 N/m 2 

IV: 266. 5X10 3 N/m 2 < a < 296.2X10 3 N/m 2 

V: 296. 2X10 3 N/m 2 < a < 377. 4xl0 3 N/m 2 

VI: o > 377. 4x10 3 N/m 2 

The least -biased probability density function for the stresses at which the part lasts 

7 — 3 

exactly 10 cycles based on the initial guess that (Tq = 284.6x10 newtons per square 

meter (41 270 psi) is of the same form as that given in equation (39) for the time-to- 

failure 


- i -cr/<r n 

f 0 (a) = ^ C 0 (65) 

CT 0 

and again, as we did before, we shall carry out the calculations for several different 
values of Mq; in this problem, let us take the initial guess as worth Mq = 1, 2, 4, or 
10 measurements. 

The fact that all the measurements are runout stresses means that the estimated 
average value of the stress in each of the intervals remains constant as the measure- 
ments proceed. Only the probability distribution over the intervals changes. The re- 
sults of the calculations are tabulated as follows: 




Interval 



I 

u 

III 

IV 

V 

VI 

n 

Estimated average stress for interval, E[cr], N/m 

64. 5X10 3 

190. 0X1 0 3 

256. 8x1 0 3 

280.6X10 3 

335. lxlO 3 

661.9X10 3 
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* 



* 

M q 

Pi 

Pn 

Pm 

Piv 

p v 

Pyi 

E[a] 

Prior probabilities 


0.389 

0.192 

0.027 

0.0385 

0.088 

0.2655 

284.6X10 3 



^ro 

= 266. 5X10 3 N/m 2 

(38 650 psi) 


10 

0. 354 

0.175 

0.0246 

0.0439 

0. 100 

0.303 

308. 8X10 3 

4 

.311 

.154 

.0216 

.0505 

. 115 

.348 

337.9 

2 

.259 

. 128 

.018 

.0584 

. 134 

. 403 

373.6 

1 

. 1945 

.096 

.0135 

.0684 

. 1562 

. 4714 

417.8 


CT ro = 296 ’ 2xl ° 3 N/m2 < 42 960 psi) + 

previous 1 

10 

0.324 

0. 160 

0.0225 

0.0386 

0. 113 

0.342 

331.9X10 3 

4 

.259 

. 128 

.018 

.0357 

. 139 

. 420 

380.2 

2 

. 195 

.096 

.0135 

.0300 

. 166 

. 500 

429.3 

1 

. 1297 

.064 

.0090 

.022 

. 193 

. 582 

479. 1 


a ro = 377 - 4X1 ° 3 N/m2 < 54 730 psi) + 

previous 2 

10 

0.299 

0. 148 

0.0208 

0.0346 

0. 105 

0.393 

357.6X10 3 

4 

.222 

.110 

.0154 

.0281 

.0910 

. 533 

430. 7 

2 

.156 

.0768 

.0108 

.0209 

.0759 

.660 

495.9 

1 

.097 

.048 

. 00675 

.0137 

.0543 

.780 

555. 5 


a rQ = 247.9X10 3 N/m 2 (35 960 psi) + 

previous 3 

10 

0.278 

0.137 

0.0220 

0.0366 

0. 102 

0.424 

375. 0x10 3 

4 

. 195 

.096 

.0164 

.0299 

.0967 

. 567 

450. 8 

2 

.1297 

.064 

.0113 

.0220 

.0797 

.693 

515. 5 

1 

.0778 

.0384 

.00698 

.0141 

.0561 

. 807 

570.9 


7 

The probability that the part lasts more than 10 cycles under an alternating 

stress of 140.1X10 3 newtons per square meter (20 325 psi) is plotted in figure 1(a) as a 

function of the number of measurements and M n . The curves satisfy a simple relation 

u 3 

because none of the runout stresses were less than 140.1x10 newtons per square meter. 
Hence, no portion of any of the four measurements is allotted to interval I and 



M o( P o)j 0. 389 M q 
M q + M M Q + M 


where M is the number of actual measurements. Obviously the estimated probability 
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(a) Estimate of probability of surviving 10 7 cycles at stress level of 
140. lxHr newtons per square meter (20 325 psi). 


640X10 3 



Run-out stresses, 
N/m 2 

1: 2G6.5X10 3 
2: 296. 2xl0 3 
3: 377.4X10 3 
4: 247.9X10 3 


(b) Estimate of average stress for which failure occurs at 10 7 cycles. 


Figure 1. - Fatigue stress example. 


7 3 

that the part lasts longer than 10 cycles under a stress of 140. 1x10 newtons per 
square meter is 


1 - 



M + 0.611 M Q 
M Q + M 


In figure 1(b), a posterior estimate of the average stress for which failure occurs at 
10 cycles is plotted as a function of the number of measurements and the value of Mq. 
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CONCLUDING REMARKS 


A method has been devised that enables the calculation of objective and reasonable 
posterior probability estimates for both discrete and continuous sample spaces. It has 
been shown that the method agrees with the rule of Bayes and provides a simple inter- 
pretation of Laplace's "principle of insufficient reason" and "rule of succession". 

The procedure is based on a reformulation of the expression for entropy first sug- 
gested by Jaynes for arriving at "least -biased" probability estimates and extends 
Jaynes' reasoning to the situation wherein experimental data is on hand in addition to any 
constraints that may apply a posteriori. 

Lewis Research Center, 

National Aeronautics and Space Administration, 

Cleveland, Ohio, October 28, 1971, 

132-15. 
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